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ABSTRACT 


Learning  is  defined  to  be  the  computation  done  by  a student  when  there  is  a 
transfer  of  information  to  him  from  a teacher.  In  the  particular  kind  of 
learning  discussed,  the  teacher  names  a source  and  a destination.  In  the 
sentence,  "Robbie  is  like  a fox,"  fox  is  the  source  and  Robbie  is  the  desti- 
nation. The  student,  on  analyzing  the  teacher's  instruction,  computes  a 
kind  of  filter  called  a transfer  frame.  It  stands  between  the  source  and 
the  destination  and  determines  what  information  is  ai lowed  to  pass  from 
one  to  the  other. 

Computing  the  transfer  frame  requires  two  steps:  hypothesis  and 
evaluation.  In  the  hypothesis  step,  potentially  useful  transfer  frames 
are  produced  through  an  analysis  of  the  information  in  the  source  and  its 
immediate  relatives.  For  Robbie,  a robot,  the  way  it  compares  with  other 
robots  would  be  noted.  In  the  evaluation  step,  the  better  of  the  hypo- 
thesized frames  are  selected  through  a study  of  the  destination  frame,  its 
relatives,  and  the  general  context. 

Some  source-destination  pairs  may  be  generated  by  the  student  acting 
alone.  There  is  also  the  possibility  of  making  notes  that  are  useful  in 
deciding  if  conclusion  makes  sense. 


This  report  describes  research  done  at  the  Artificial  Intelligence  Laboratory 
of  the  Massachusetts  Institute  of  Technology.  Support  for  the  laboratory's 
artificial  intelligence  research  is  provided  in  part  by  the  Advanced  Research 
Projects  Agency  of  the  Department  of  Defense  under  Office  of  Naval  Research 
contract  N00014-75-C-0643.  ^ 
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THE  PROBLEM 

Learning  remains  an  enigma.  In  spite  of  .-itrong  efforts  by  good  people,  we  have  barely 
scratched  the  surface.  It  is  hard  to  write  computer  programs  that  can  learn  even  simple 
things.  It  is  even  hard  to  be  precise  about  what  learning  is. 

For  this  paper,  learning  is  defined  to  be  the  computation  done  by  a student  when 
there  is  a transfer  of  information  to  him  from  a teacher.  Normally,  both  must  do  some 
work.  The  amount  of  work  done  by  the  two  participants  in  the  transfer  can  vary 
between  two  extreems,  however.  As  illustiated  in  figure  I,  there  is  a spectrum  starting 
with  learning  by  being  told,  extending  through  learning  by  studying  samples,  and 
ending  with  learning  by  self-sufficient  discovery.  To  some  people,  only  learning  by 
discovery  counts  as  legitimate  learning,  but  such  a posture  seems  extreme. 

This  paper  concentrates  on  middle  ground  in  the  vicinity  of  learning  by  being 
told.  It  offers  a theory  of  learning  by  hypothesizing  and  evaluating  certain  structures 
that  will  be  called  transfer  frames. 

The  methodology.  Since  learning  is  such  a broad,  complex  phenomenon,  it  is  sensible  to 
be  very  precise  about  the  nature  of  the  attack.  This  is  an  adptation  of  the  approach 
used  by  Marr  in  his  fundatmental  work  or  vision  [Marr]: 

■ First,  it  is  necessary  to  observe  c*r  define  some  learning  competance  to  be 
understood. 

■ Second,  a representation  should  be  selected  or  invented  that  is  capable  of 
capturing  the  knowledge  to  be  learned. 

■ Third,  the  first  and  second  items  should  be  translated  into  a precisely  defined 
computation  problem  to  be  solved. 

■ Fourth,  algorithms  should  be  devised  that  preform  the  desired  computation. 

■ And  fifth,  the  results  so  far  should  be  validated  either  by  successful  computer 
implementation  and  experimentation  or  by  appropriate  psychological  inquiries. 

All  this  seems  obvious,  but  there  are  strong  temptations  that  often  throw  research  out  of 
proper  perspective.  One  is  to  be  caught  up  with  an  attraction  to  a particular 
representation.  Worse  yet,  there  may  be  an  attachment  to  some  particular  algorithm, 
with  a corrolary  failure  to  understand  that  many  algorithms  usually  can  be  devised  once 
a computational  problem  is  properly  laid  out. 

Therefore,  let  us  begin  with  a synopsis  concentrating  on  the  definition  of  a kind 
of  learning  competance  and  on  the  selection  of  a representation  that  seems  appropriate 
to  it.  Then  we  will  turn  to  the  details  of  the  algoritms  so  far  devised,  implemented  on  a 
computer,  and  experimented  with. 

Defining  the  competance  to  be  understood.  Consider  the  following  statement: 


Robbie  is  a robot. 


LEARNING  BY  LEARNING  BY  LEARNING  BY  LEARNING  BY 

ROTE  MEMORY  BEING  TOLD  ANALYZING  DISCOVERY 

SAMPLES 


TEACHER  DOES  STUDENT  DOES 

ALL  THE  WORK  ALL  THE  WORK 


Figure  1.  Learning  may  involve  little  work  on  the  part  of  the  learner  or  a lot.  For  there 
to  be  genuine  learning,  most  people  demand  that  the  learner  activiy  participate  in  the 
learning  process,  The  simplest  learning,  really  not  learning  at  all,  is  learning  by  being 
programmed,  with  the  learner  doing  nothing  save  submitting  to  the  program  surgery 
performed  by  the  teacher.  Learner  participation  begins  when  the  learning  is  by  being 
told  or  by  understanding  a series  of  samples.  In  the  extreme,  the  participation  of  the 
learner  is  total,  to  the  exclusion  of  the  teac'ier,  and  there  is  learning  by  self-discovery. 
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Hearing  this,  a human  or  computer  student  should  assume  some  facts  about  Robbie  and 
becomes  curious  about  others.  Robbie  is  probably  made  of  metal,  and  it  would  be 
interesting  to  know  if  he  is  intelligent. 

Now  consider  these: 

Robbie  has  a very  high  degree  of  cleverness. 

Robbie  is  clever. 

Robbie  is  clever  like  a fox. 

Robbie  is  like  a fox. 

All  convey  approximately,  but  not  exactly  the  same  idea.  The  first  is  the  most  precise. 
The  second  differs  very  little  from  it.  The  third  adds  some  nauance  to  the  meaning 
conveyed.  And  the  forth  requires  extra  work  for  understanding  since  cleverness  is  not 
mentioned  explicitely. 

Why  do  we  use  "Robbie  is  like  a fox,"  instead  of  "Robbie  has  a very  high  degree 
of  cleverness?"  It  is  not  just  a form  of  shorthand: 

■ The  teacher  conveys  more  by  using  examples. 

■ The  teacher  cannot  know  how  explicit  he  must  be  since  he  cannot  have  a perfect 
model  of  the  student. 

■ The  teacher  may  be  unable  to  exactly  articulate  what  he  knows.  He  may  have  a 
piece  of  knowlege  without  being  able  to  present  it  according  to  the  particular 
form  required  for  conversational  instruction.  He  may  have  to  allude,  rather  than 
state. 

In  addition,  other  things  happen.  The  student  may  become  curious  about 
whether  Robbie  is  like  a fox  in  other  ways.  Given  that  Robbie  is  clever,  the  student 
may  wonder  if  Robbie  is  also  like  Suzie,  another  robot  already  known  to  be  clever.  Still 
another  possibility  is  that  the  student  may  wonder  if  Robbie  is  like  something  or 
someone  known  to  be  the  quintescence  of  cleverness. 

In  summary,  then,  the  competance  to  be  understood  is  the  competance  to  absorb 
both  explicit  and  simile-like  instruction  and  the  competance  to  be  curious  on  learning 
new  information. 

The  representation.  What  is  to  be  the  representation  selected  to  be  the  target  of 
learning  as  so  circumscribed?  That  is  to  s:»y,  what  conventions  about  symbols  and  their 
arrangement  are  suited  to  capturing  the  knowledge  to  be  learned.  Of  the  many 
representations  available  now,  the  frames  representation  seems  best  suited  in  terms  of 
the  point  of  view  it  encourages. 

Roughly,  a frame  is  a collection  of  properties.  Here,  for  example,  is  a frame 
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describing  a fox: 

FRAME  NAME  SLOT  VALUE 

FOX  A-KIND-OF  SMALL-MAMMAL 

COLOR  RED 

CLEVERNESS  VERY-HIGH 

The  frame  name  identifies  what  is  to  be  described.  Each  of  the  properties  that 
constitute  the  description  is  conveyed  by  a so-called  slot-value  combination.  Each  slot 
name  specifies  a property  and  the  value  associated  with  a slot  dictates  what  is  known 
about  the  corresponding  property  for  the  thing  described  by  the  frame. 

From  a programming  point  of  view,  this  use  of  the  frame  representation  scheme 
is  exactly  like  an  ordinary  property-list  representation.  Indeed,  the  frame  idea  can  be 
defined  as  a generalization  of  the  property  list,  and  the  points  of  generalization,  as 
programming  mechanisms,  are  not  exploited  here.  The  frames  vocabulary  is  used, 
nevertheless,  because  Minsky’s  original  paper  brought  about  a certain  point  of  view 
[Minsky].  This  point  of  view  contributed  strongly  to  the  way  of  looking  at  learning 
offered  here. 

One  ready  objection  is  that  programs  using  a frame  representation  can  learn 
nothing  that  is  not  expressible  in  terms  of  frames.  This  seems  true,  but  not  confining. 
The  world  of  objects,  classes,  and  even  processes  that  can  be  described  in  terms  of 
frames  seems  amply  large  for  useful  learning  research. 

The  computational  problem.  The  key  computational  problem,  therefore,  is  to  fill  frame 
slots  using  information  given  by  a teacher  either  explicitely  or  in  the  form  of  simile-like 
instructions. 

An  algorithm.  Here  is  the  essense  of  an  j.lgorithm,  to  be  described  in  detail  later,  that 
accomplishes  the  computation  required  to  deal  with  simile-like  instruction; 

B The  teacher,  names  a source  frarric  and  a destination  frame.  In  the  sentence, 
Robbie  is  like  a fox,' fox  is  the  source  and  Robbie  is  the  destination.  The  teacher 
may  or  may  not  specify  the  exact  sbts  in  which  the  source  and  destination  have 
the  same  values.  He  may  tell  the  student  that  Robbie  and  a fox  are  alike  with 
respect  to  cleverness  or  he  may  just  say  that  they  are  alike. 

■ The  student,  on  analyzing  the  teacher’s  instruction,  computes  a transfer  frame. 
The  transfer  frame  is  a filter.  It  stands  between  the  source  and  the  destination  as 
in  figure  2,  determining  exactly  what  slot-value  combinations  are  allowed  to  pass 
from  one  to  the  other. 


F 
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Figure  2.  The  basic  idea  behind  learning  using  transfer  frames.  The  teacher  specifies  a 
source  and  a destination  and  possibly  the  slots  that  are  relevant.  The  student  analyzes 
the  source,  the  destination,  and  other  aspects  of  the  situation  to  discover  and  use  a 
transfer  frame. 
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■ Computing  the  transfer  frame  requires  two  steps:  hypothesis  and  evaluation.  In 
the  hypothesis  step,  potentially  useful  transfer  frames  are  produced  through  an 
analysis  of  the  information  in  the  source  frame  and  its  immediate  relatives.  For 
Robbie,  a robot,  the  way  it  compares  with  other  robots  would  be  noted.  For  a 
fox,  other  small  common  forest  mammals  would  be  used.  In  the  evaluation  step, 
the  better  of  the  hypothesized  frames  are  selected  through  a study  of  the 
destination  frame,  its  relatives,  and  the  general  context  that  exists  by  way  of 
previous  instruction. 

This  preview  is  given  only  to  provide  a flavor.  Much  more  will  be  said  about  these 
procedures  as  well  as  others  that  deal  v;ith  justification  of  transfers  and  internal 
generation  of  transfer  possibilities. 

Validation.  The  procedures  described  in  this  paper  have  been  implemented  and  tested 
on  the  examples  to  be  given.  Exceptions  are  clearly  noted.  No  claim  is  made  about 
psychological  validation,  however.  When  the  words  teacher  and  student  are  used,  the 
following  is  to  be  understood: 

■ The  teacher  is  a human  instructor. 

■ The  student  is  an  experimental  collection  of  algorithms  implemented  as  computer 
programs. 

The  programs  are  in  LISP.  Listings  are  available. 

In  a moment,  we  will  look  at  the  details  of  a running  program  that  performs  some 
simple  learning  that  is  in  accord  with  the  points  of  competence  proposed  here.  To  keep 
our  own  knowledge  from  getting  too  much  in  the  way  of  thinking  about  the  ideas,  a 
semantically  deprived  world  is  used  for  tne  explanation.  One  consequence  is  that  we 
too  will  have  to  work  at  understanding  what  is  to  be  learned. 

HYPHOTHESIS  AND  EVALUATION 

If  the  source,  destination,  and  transfer  frame  are  given,  there  is  nothing  left  to  do  but 
rush  the  slot  values  through  the  transfer  frame.  But  since  the  transfer  frame  is  usually 
not  given,  the  learner  must  do  some  work  to  dig  out  the  meaning  and  acquire  new 
knowledge.  It  is  this  active  participation  of  the  listener  that  makes  the  learning 
interesting.  To  illustrate  how  transfer  frames  can  be  hypothesized  and  evaluated,  we 
now  look  at  some  very  simple  examples  f'om  the  blocks  world  shown  in  figure  3 and 
figure  4.  This  world  is  used  specifically  to  make  it  easy  to  construct  examples  that 
illustrate  all  of  the  methods.  Note  that  Figure  4 shows  how  the  concepts  are  linked  by 
AKO  relationships,  short  for  A-KIND-OF.  INSTANCE  is  the  opposite  of  AKO. 

There  Are  Several  Ways  To  Hypothesize  Transfer  Frames 

Transfer  frame  hypothesizing  begins  by  collecting  together  all  of  the  slots  in  the  source 
frame  which  are  filled  with  the  values  VERY-LOW  or  VERY-HIGH.  The  theory  is 
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that  concepts  which  exhibit  properties  to  an  unusual  degree  are  potentially  good  sources 
for  those  properties.  Suppose,  for  examplt,  we  have  the  following  instruction; 

U1  IS  Uke  PI. 

To  understand  this,  the  student  looks  at  the  frame  for  PI; 

PI  AKO  PVFAMID 

HEIGHT  VERY-HIGH 

COLOR  REG 

MATERIAL  WOOD 

Clearly  the  only  slot  with  a VERY-HIGH  value  is  HEIGHT.  This  is  therefore 
transferred  to  U1  using  the  following  transfer  frame; 

TF-HEIGHT  TRANSFER-SLOTS  HEIGHT 

If  UI  had  a known  height  already,  then  the  propose  transfer  frame  would  have 
bee.n  rejected  immediately.  If  the  first  method  fails  to  find  a viable  transfer  frame, 
others  are  tried  until  one  works. 

The  next  method  again  searches  for  important  slots,  but  this  time  on  the  basis  of 
global  knowledge.  Slots  whose  own  descriptive  frames  contain  VERY-HIGH  in  their 
IMPORTANCE  slots  are  deemed  globally  important,  and  they  are  all  collected.  The 
slot  PURPOSE,  for  example,  is  globally  important.  Consequently  the  following  results 
in  learning  that  Ul  is  for  storage. 

UI  IS  like  XL 


Inspection  of  the  Xl  and  PURPOSE  frames  shows  why; 


XI 

AKO 

B0.\ 

COLOR 

GREEN 

MATERIAL 

WOOD 

PURPOSE 

STORAGE 

PURPOSE 

AKO 

FUNCTIONAL-PROPERTY 

IMPORTANCE 

VEF.Y-HIGH 

Having  dispensed  with  slots  filled  with  exceptional  values  and  slots  known  to  be 
globally  important,  the  next  method  concentrates  on  slots  which  are  unusual  for 
concepts  in  the  same  class  as  the  source.  Thus  the  slot  MATERIAL,  found  in  LI.  would 
be  judged  important  there  because  there  ue  three  balls.  LI.  L2.  and  L3,  and  of  these, 
only  LI  has  WOOD  in  the  MATERIAL  slot,  which  for  balls  is  unusual: 


Figure  4.  The  hierarchical  organization  of  the  blocks  world.  The  structure  reflects  how 
concepts  are  linked  by  the  A-KIND-OF  relation. 


OKIENTATION-P 


DIRECTION 

ATTITUDE 


PROPERTY-P 


PHYSICAL-P 


SIZE-P 


COLOR-P 


BEAUTY-P 

MATERIAL-P. 


OVERALL-APPEARANCE-P - 


TOP-APPEARANCE-P' 


SIZE 
WE  I GHT 
WIDTH 
DEPTH 

COLOR 

SATURATION 

BRIGHTNESS 


HOMOGENAITY 

MATERIAL 

DENSITY 

OVERALL-POINTEDNESS 

OVERALL-ROUNDNESS 

TOP-F 

TOP-H 


MENTAL-P 


SUPPORTING-STRENGTH 
INTELLIGENCE-P 


INTELLIGENCE 

COMMON-SENSE 

REASONING-POWER 

CLEVERNESS 

SCHOLARSHIP 

LEARNING-SPEED 

MEMORY 

PLANNING-ABILITY 


INTERACT 


FRIENDLINESS 

VERBOSITY 

COOPERATIVENESS 


ENTHUSIASM-P 


MOTION-P 


SPEED 

GRACE 


MEASURE-VALUE 


VERY-HIGH 

HIGH 

AVERAGE 

LOW 

VERY-LOW 


Figure  5.  The  hierarchical  organization  of  the  properties  used  in  the  examples. 
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MEDIUM 

COLOR 

BLUE 

MATERIAL 

WOOD 

L2 
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COLOR 

RED 

MATERIAL 

RUBBER 

L3 

AKO 

BALL 

SIZE 

MEDIUM 

COLOR 

BLUE 

MATERIAL 
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Consequently  the  following  results  in  knowing  that  U1  has  WOOD  in  the  MATERIAL 
slot; 


U1  is  like  LI. 

Now  suppose  that  we  move  to  U2  and  offer  the  following  information: 

U2  IS  a CYLINDER. 

U2  IS  like  Bl. 

Bl,  unfortunately,  is  rather  undistinguished: 


Bl  AKO  BRICK 

SIZE  NEI'IUM 

COLOR  BLUE 

MATERIAL  WOOD 


Consequently,  none  of  the  hypothesizing  methods  given  so  far  find  anything,  and  the 
learner  must  simply  gather  up  all  the  slots. 

Note  that  after  all  of  the  slots  are  collected,  they  could  be  assembled  together  into 
a single  transfer  frame,  but  it  seems  betier  to  group  them  together  according  to  the 
property  categories  involved.  This  is  true  no  matter  what  hypothesizing  method  is  used 
to  collect  them.  Thus  Bl’s  SIZE,  COLOFt,  and  MATERIAL  slots,  none  of  which  are 
closely  related  as  figure  5 shows,  form  three  corresponding  transfer  frames. 

There  Are  Several  Ways  To  Evaluate  The  Transfer  Frames 

It  is  now  up  to  the  evaluation  methods  to  determine  which  transfer  frame  to  use. 
Several  of  these  methods  examine  relatives  of  the  destination,  looking  carefully  for 
evidence  that  can  pull  the  better  transfer  frames  out  of  the  pack.  Consequently,  it  is 
important  to  know  that  U2  is  a kind  of  cylinder  and  that  Yl  and  Y2  are  too. 
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U2 

AKO 

CYLINDER 

Y1 

AKO 

CYLINDER 

COLOR 

BLLE 

SIZE 

HIGH 

Y2 

AKO 

CYLINDER 

COLOR 

GREEN 

CYLINDER 

AKO 

THING 

INSTANCE 

Y1 

Y2 

UZ 

TYPICAL- 

INSTANCE 

TI-CYLINDER 

The  typical  instance  is  a frame  created  to  describe  how  the  instances  are  alike. 
Following  earlier  work,  typical  instances  ar;  computed  as  follows: 

■ If  a slot-value  combination  appears  in  more  than  some  fraction  of  the  instances, 
Tjy,  put  that  combination  in  the  typical  instance. 

■ If  a slot  appears  in  more  than  some  fraction  of  the  instances,  Tj,  but  is  not  filled 
uniformly  enough  to  pass  the  first  test,  put  it  in  the  typical  instance  without  a 
value. 

At  the  moment,  both  thresholds  are  set  at  6>57..  Hence  for  the  given  cylinders,  the  typical 
instance  is  very  simple: 


TI-CYLINDER 


COLOR 


Thus  the  typical  thing  in  the  cylinder  class  has  some  color.  The  first  transfer  frame 
evaluation  method  exploits  this  information  to  pick  out  the  transfer  frame  with  the 
COLOR  slot  since  the  typical  instance  indicates  that  color  is  a commonly  filled  slot,  one 
that  is  therefore  wanted  in  some  sense  by  the  destination. 

As  of  now,  we  therefore  have  the  following  frames: 


BI 

AKO 

BRICK 

SIZE 

MEDIUM 

COLOR 

BLUE 

MATERIAL 

W0C>D 

U2 

AKO 

CYLINDER 

COLOR 

BLUE 

TRANSFERRED-FROn 


B1 


Note  that  the  COLOR  slot  of  U2  has  the  BLUE  value  augmented  by  a comment 
specifying  where  the  value  came  from.  This  exercises  more  of  the  Goldstein-Roberts 
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frame  system. 

Now  suppose  the  following  is  given  again: 

U2  IS  like  Bl. 

Only  the  slots  SIZE  and  MATERIAL  emerge  because  COLOR  is  already  filled.  These 
form  two  frames,  neither  of  which  is  better  than  the  other  with  respect  to  the  typical 
instance.  Consequently  another,  weaker,  method  is  used.  This  other  method  notices  that 
some  sibling  of  U2  has  a SIZE  slot,  namely  Yl.  On  the  other  hand  no  sibling  has  a 
MATERIAL  slot.  Hence  the  evidence  favors  using  SIZE  since  it  is  more  likely  to  apply 
than  MATERIAL.  Evidently  U2  is  medium  in  size. 

Next,  to  expose  still  another  evaluation  method,  let  us  consider  the  following: 

U3  is  like  Cl. 

Assume  that  nothing  more  is  known  about  U3  and  that  Cl  is  described  as  follows; 


Cl  AKO  CUEE 

SIZE  MEI'IUM 

COLOR  BLUE 

MATERIAL  PLASTIC 


As  in  the  example  using  Bl,  three  frames  are  created,  one  for  SIZE,  COLOR,  and 
MATERIAL.  Now,  however,  there  are  no  known  relatives  of  U3  yet,  so  none  of  the 
previous  evaluation  methods  work.  The  decision,  given  that  the  sequence  is  connected, 
goes  to  the  frame  that  is  most  in  keeping  with  the  context  determined  by  the  last 
transfer.  The  last  transfer  involved  size,  so  this  one  will  too.  Actually  the  context  is 
always  reset  to  be  the  node  in  the  property  tree  just  above  the  last  slot  used. 
Consequently  the  context  established  is  SIZE-P,  as  shown  in  figure  5,  and  anything 
from  the  group  SIZE,  HEIGHT,  WIDTH,  or  WEIGHT  passes. 

This  concludes  the  discussion  of  evaluators.  Certainly  the  implementation  is 
preliminary  and  a lot  of  changes  may  be  found  appropriate.  For  example,  it  is  not  clear 
that  the  dividing  line  between  the  transfer  frame  hypothesizers  and  the  transfer  frame 
evaluators  is  correctly  placed.  The  same  basic  strategy  may  be  useful  either  way. 

Near  Misses  May  Generate  Transfers  As  Well  As  Examples 

Previous  work  stressed  the  idea  of  near  .misses,  samples  which  are  not  like  the  thing 
being  described  in  some  important  way  [Winston].  Using  a near  miss  in  a teaching 
sequence  usually  resulted  in  the  placement  of  some  so-called  emphatic  relationship  such 
as  MUST-BE-AKO  BRICK  or  MUST-NOT-BE  RED.  The  programs  being 
described  do  not  deal  with  this  important  leaching  idea  only  because  the  thrust  is  in  the 
direction  of  dealing  with  new  ideas,  not  because  the  old  ones  have  been  superseded. 
Indeed  it  is  fairly  clear  how  near-miss  action  could  be  incorperated  into  the  current 
system: 


Use  the  same  hypothesis  methods  without  change. 


Winston 


10 


May  3.  1977 


■ Use  the  same  evaluation  methods,  except  that  slots  are  not  to  be  rejected  merely 
because  they  happen  to  be  filled  in  the  destination  frame. 

■ Revise  the  way  the  transfer  frame  is  used  to  carry  slot  information  from  the 
source  to  the  destination.  Instead  of  adding  to  the  VALUE  facet  of  the  slot,  add 
to  the  MUST-BE  facet  or  the  MUST-NOT-BE  facet  instead.  Or,  since  the 
Coldstein-Roberts  frame  system  does  not  have  these  facets,  it  might  be  easier  to 
get  their  effect  by  placing  little  programs  into  the  existing  REQUIRE  facet. 

With  this  it  would  be  possible  to  give  the  following; 

ARCH  IS  not  like  TABLE. 

The  expected  result  would  be  the  placement  of  EAT  and  WRITE  in  the 

MUST-NOT-BE  facet  of  the  PURPOSE  slot  of  ARCH.  This  would  happen  even  if 

ARCH  already  had  something  in  the  VALUE  facet  of  the  PURPOSE  slot. 

Summary 

The  hypothesis  and  evaluation  methods  are  as  follows: 

■ Use  a remembered  transfer  frame.  Examples  of  this  will  be  given  later. 

■ Make  a transfer  frame  using  slots  with  extreme  values. 

■ Make  one  using  slots  that  are  known  to  be  important  in  general. 

■ Look  for  slots  that  are  unique  with  respect  to  the  source's  siblings. 

■ Look  for  slots  that  have  unique  values  in  them  with  respect  to  the  siblings. 

■ Use  all  of  the  source’s  slots. 

Thus  hypothesis  methods  concentrate  on  Icoking  at  the  source  and  its  context. 

■ Weed  out  the  slots  that  are  already  filled  in  the  destination. 

■ Group  the  slots  using  the  property  hierarchy  tree. 

■ Prefer  transfer  frames  that  have  slots  that  are  present  in  the  typical  instance 
associated  with  the  destination. 

■ Prefer  those  that  have  slots  that  some  sibling  of  the  destination  exhibits. 

■ Prefer  those  that  are  relevant  to  the  context  established  by  the  last  transfer. 
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Thus  evaluation  methods  concentrate  on  Icoking  at  the  destination  and  its  context. 
JUSTIFICATION  AND  CURIOSITY 

Once  a transfer  frame  is  in  hand,  then  it  is  necessary  to  decide  if  using  it  really  makes 
sense  given  all  that  is  known  about  the  slots  it  will  effect.  If  it  is  used,  then  there  is  the 
further  question  of  whether  the  new  knowledge  gained  about  the  destination  should 
trigger  the  student  into  further,  internally  generated  speculation. 

There  Are  Several  Ways  To  Justify  The  Transfer  Frames 

Once  transfer  frames  have  been  found  and  ordered  by  hypothesizers  and  evaluators, 
the  next  job  is  to  decide  if  they  are  indeed  legitimate.  There  are  at  least  four  ways  to 
do  this.  They  will  be  described  in  the  order  of  increasing  length  of  explanation. 

First,  of  course,  the  student  can  ask  the  teacher  directly  if  the  frame  is 
appropriate.  Second,  the  student  can  use  :he  restriction  feature  of  the  frame  system  to 
prevent  the  insertion  of  values  that  conflict  with  restriction  knowledge  that  exists  in  the 
AKO  hierarchy.  This  comes  automatically  with  the  Goldstein-Roberts  frame  language. 

Third,  there  is  a method  involving  inspection  of  the  AKO/INSTANCE  path 
between  the  source  and  the  destination.  The  basic  idea  is  that  if  the  frames  on  the 
source-destination  path  do  not  show  a kind  of  distance  with  respect  to  the  slots  and 
values  to  be  transferred,  then  the  transfer  is  judged  to  be  weak.  For  a simple  example, 
suppose  that  the  following  is  given: 

U4  IS  a BRICK. 

U4  IS  like  Bl. 

The  SIZE.  MATERIAL,  and  COLOR  slots  seem  appropriate  for  transfer  given  the 
hypothesizing  and  evaluation  methods  described.  Looking  more  closely,  however,  SIZE 
seems  safest  because  all  bricks  are  of  medium  size.  The  MATERIAL  slot  comes  next 
because  all  but  one  are  made  of  wood,  as  is  Bl,  whereas  there  are  many  values  for 
COLOR.  The  BRICK  node  therefore  has  greatest  admittance  with  respect  to  the 
SIZE-MEDIUM  combination.  Here  is  a crude  formula  for  quantifying  this  notion: 

<admittance  of  node  N> 

• (a  - 4 1)  / (a  I)  for  a > 0 

- 1 for  a > 0 

where  a is  the  number  of  times  a slot  is  filled  in  N and  the  children  of  N and  b is 
the  number  of  times  the  slot  is  filleil  with  a value  different  from  the  one  in  the 
source  frame. 

Note  that  if  a slot  is  always  filled  the  same  way  as  it  is  in  the  source,  b will  be  0 and  the 
admittance  will  be  I.  On  the  other  hand,  if  none  of  the  values  that  appear  are  the  same, 
a - 4 and  the  admittance  is  1/4,  and  this  number  can  get  small.  For  the  bricks  in  this 
example,  the  known  brick  colors  are  blue,  green,  red,  blue,  and  undefined,  giving  an 
admittance  with  respect  to  COLOR-BLUE  of  (4  - 2 + I)  / (4  + I)  - .6.  All  but  one 
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of  the  bricks  are  made  of  wood,  however,  so  the  admittance  with  respect  to 
MATERIAL  is  (5  - 1 + 1)  / (5  + 1)  - .83.  SIZE  wins  because  its  admittance  is  I.  It  is 
safer  to  transfer  the  SIZE-MEDIUM  combination  from  BI  than  it  is  to  transfer 
MATERIAL-WOOD  or  COLOR-RED. 

This  situation  for  which  the  transfer  is  among  siblings  is  particularly  simple 
because  the  admittance  is  just  a function  of  the  parent  and  all  the  siblings.  For  the 
following,  more  must  be  done: 

U-1  is  like  L3. 

Again  SIZE,  MATERIAL,  and  COLOR  are  the  candidate  slots.  To  compare  them,  at 
the  moment,  it  seems  sensible  to  calculate  the  admittance  for  all  nodes  intervening 
between  U4  and  L3,  to  multiply  the  results  together,  and  to  let  that  be  the  so-called  path 
admittance.  For  the  example,  there  are  four  nodes  to  deal  with  as  shown  in  figure  4, 
namely  BRICK,  RECTANGULAR-BLOCK,  BLOCK,  and  BALL.  But  two  of  them 
have  neither  SIZE,  MATERIAL,  nor  COLOR  slots  and  contribute  nothing.  Now 
COLOR-BLUE  wins,  for  its  pcth  admittance  is  .75  « .6  - .45  while  that  for 
material-rubber  is  .75  * .16  - .125  and  that  for  SIZE-SMALL 
IS  .5  * .166  - .083. 

In  point  of  fact,  it  makes  some  ser.se  to  use  not  just  the  slots  in  the  proposed 
transfer  frame,  but  also  the  siblings  of  these  slots,  on  the  ground  that  similar  properties 
tend  to  be  coherent  or  dispersive  together.  Thus  the  admittance  for  a transfer  frame 
with  only  a BRIGHTNESS  slot  causes  a admittance  measurement  with  respect  to 
COLOR,  SATURATION,  and  BRIGHTNESS. 

Now  we  turn  to  a fourth  method  fer  judging  the  quality  of  .a  proposed  transfer, 
one  that  requires  the  student  to  take  notes  on  why  transfers  seem  to  work  and  to  create 
justification  frames  that  can  be  matched  against  a proposed  destination  to  see  if  the 
destination  exhibits  apparently  essential  slot  values. 

Suppose,  for  example,  that  the  student  knows  Cl  has  VERY-HIGH  in  the  slots 
TOP-F  and  TOP-H  Further  suppose  that  the  teacher  gives  this: 

Cl  IS  like  TABLE 

Since  the  teacher  presses  home  the  similarity,  certainly  the  intent  must  be  that  it  is 
possible  to  eat  from  or  write  on  Cl,  just  is  it  is  with  a table,  since  there  are  now  the 
following  fran'es: 


TABLE 

AKO 

STRUCTURE 

PURPOSE 

EAT 

WRITE 

SIZE 

MEC'IUH 

TOP-F 

VERV-HIGH 

TOP-H 

VERY-HIGH 

HAS-PART 

B5 

Y2 
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Cl 


AKO 

SIZE 

COLOR 

MATERIAL 

TOP-F 

TOP-H 


CUEE 

MEDIUM 

BLUE 

PLASTIC 

VERY-HIGH 

VERY-HIGH 


PURPOSE  is  the  only  possible  slot  for  transfer,  but  it  would  have  been  selected  by 
several  hypothesis  methods  anyway.  After  the  transfer,  the  student,  on  the  request  of 
the  teacher,  looks  to  see  how  the  source  and  the  destination  resemble  one  another, 
remembers  the  transfer  frame,  and  constructs  a justification  frame  that  reflects  the 
similarity; 

TABLE  AKO  STRUCTURE 


TF-1 


JF-1 


AKO  TRAJJSFER-FRAME 

TRANSFER-SLOTS  PURPOSE 
TRANSFERRED-FROM 

TAELE 

TRANSFERRED-TO 

TAELE 

JUSTIFICATION-FRAME. 


JF-1 

AKO 

JUETIFICATION-FRAME 

SIZE 

MEDIUM 

TOP-F 

VERY-HIGH 

TOP-H 

VERY-HIGH 

Now  TABLE  has  become  a standard  source  of  particular  values  for  the  PURPOSE 
slot,  namely  EAT  and  WRITE,  through  the  skillful  selection  of  circumstance  by  the 
teacher. 

Henceforward,  a new  hypothesize!'  will  be  the  first  to  work.  It  will  look  for 
values  in  the  TRANSFER-FRAME  slot.  In  this  example,  it  finds  one  for  TABLE, 
namely  TF-I. 

More  importantly  perhaps,  the  student  now  has  a justification  frame  attached  to 
this  standard  transfer  frame.  This  justification  frame  must  be  a subframe  of  a 
proposed  new  destination  if  the  new  destination  is  to  pass.  In  this  example,  for 
PURPOSE  to  be  transferred  to  a destination  from  TABLE,  the  destination  must  have 
the  SIZE,  TOP-F,  and  TOP-H  slot  values  dictated  by  the  justification  frame,  JF-l. 

Consider  this; 


C2  is  like  TABLE. 


There  will  be  a justified  PURPOSE  transfer  if  the  student's  C2  frame  has  the  three 
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key  justification  frame  slots  properly  filled  or  if  the  student  can  get  proper  values  from 
the  teacher  or  f rom  his  own  sensory  apparatus. 

In  general,  this  is  really  only  a mechanism  for  getting  a first  idea  of  why  a given 
transfer  is  justified.  Further  refinement  cf  the  justification  frame  is  possible  by  direct 
telling  or  by  fresh  transfers  to  it  as  a destination. 

While  all  this  student  note  taking  is  going  on,  information  is  also  added  to  the 
TOP-F  and  TOP-H  and  SIZE  frames: 


TOP-F 


AKO  TOf-APPEARAMCE-P 

TRIGGER-VALUE  VERY-HIGH  TRANSFER -SOURCE  TABLE 


TOP-H 


AKO  TOF-APPEARANCE-P 

TRIGGER-VALUE  VERY-HIGH  TRANSFER-SOURCE  TABLE 


SIZE 


AKO  SIZE-P 

TRIGGER-VALUE  MEUIUM  TRANSFER-SOURCE  TABLE 


Of  course  the  TRIGGER- VALUE  slot  for  SIZE  will  become  gorged  far  sooner  than 
for  TOP-F  and  TOP-H  since  SIZE  in  t.  more  common  property.  This  means  that 
SIZE  will  not  be  as  useful  as  the  other  two  with  respect  to  the  use  of  trigger  values 
about  to  be  described. 


Filling  A Slot  May  Induce  Curiosity 

It  IS  reasonable  for  the  student,  having  just  learned  something,  to  make  conjectures 
based  on  the  new  knowledge.  Often  these  conjectures  will  be  wrong  since  they  are 
generated  interally  using  rather  flimsy  heuristic  evidence.  Hence  it  will  be  more 
important  than  usual  to  use  the  various  juitification  methods  to  confirm  the  conjectures. 

To  see  how  other  conjecture  methods  work,  suppose  that  U5  has  the  following 
description: 

U5  AKO  BRICK 

SIZE  5N/.LL 


The  first  conjecture  method  uses  information  placed  when  Justification 
frames  are  made.  Suppose  that  the  following  is  given: 

U5  has  VERY-HIGH  in  the  TOP-F  slot. 

From  this,  and  permission  of  t.ie  teacher  to  think  a bit,  it  is 
reasonable  for  the  student  to  examine  the  TOP-F  frame  for  clues  about 
other  properties  of  U5.  The  TRIGCER-VAIUE  slot  of  TOP-F  contains  the 
value  VERY-HIGH  along  with  a comment  to  the  effect  that  the  value  was 
placed  while  constructing  a justification  frame  involving  a transfer 
from  TABLE.  Since  VERY-HIGH  in  tha  TOP-F  slot  evidently  helped  justify 
a transfer  from  TABLE  in  the  past,  it  is  reasonable  for  the  student  to 
try  a transfer  from  TABLE  to  US  again.  Thankfully  the  trigger  value 
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information  only  exists  if  a justification  frame  also  exists.  The 
student  therefore  has  a justification  frame  that  he  can  use  to  decide  if 
the  transfer  makes  sense,  possibly  asking  the  teacher  some  questions 
along  the  way  about  the  slots  that  the  justification  frame  specifies. 

A second  conjecture  method  uses  siblings.  Suppose  that  the 
following  is  given: 

U5  has  BLUE  in  the  COLOR  slot. 

Using  this,  the  student  may  want  to  look  for  siblings  that  are  also 
blue  with  the  hope  that  US  and  such  a sibling  may  be  alike  in  other 
ways.  Indeed,  this  happens.  Siblings  with  BLUE  in  the  COLOR  slot  are 
collected  and  the  most  typical  one  becomes  a conjectured  source. 

The  most  typical  blue  sibling  is  determined  using  a frame 
similarity  computation  defined  as  follows: 

<frame  similarity  between  X ard  Y> 

« alb  for  b >0 
- 0 for  5 - 0 

where  a is  the  number  of  slot-value  combinations  that  appear  in  both  X and  Y 

and  f»  IS  the  number  of  slot-value  combinations  that  appear  in  either. 

If  all  of  the  slots  in  X and  Y have  diffe.'ent  values,  then  the  frame  similarity  will  be 
zero.  If  all  of  the  slots  in  X and  Y have  the  same  values,  then  it  will  be  one. 

In  this  example,  both  Bl  and  B4  are  blue.  Bl  is  judged  the  more  typical  of  the 
two  because  the  frame  similarity  between  Bl  and  the  typical  instance  frame  for  BRICK, 
TI-BRICK,  IS  .66,  whereas  the  frame  sim.Iarity  between  64  and  TI-BRICK  is  only  .5. 
The  difference  is  the  result  of  a PURPOS-E-SUPPORT  slot-value  combination  present 
in  B4  but  missing  in  Bl  and  TI-BRICK. 

Thus,  learning  that  U5  is  blue  may  result  m a transfer  from  Bl  which  would 
assert  that  U5  is  made  of  wood.  Again,  whatever  justification  methods  arc  available 
should  be  used.  Moreover,  it  would  be  rrore  sensible  to  transfer  only  those  properties 
from  Bl  that  are  closely  related  to  the  COLOR  slot  since  the  conjecturing  method  is  so 
tenuous.  This,  however,  has  not  been  imp'emented  in  the  existing  system. 

Making  these  conjectures  is  one  k:nd  of  "curiosity."  Another  can  come  from 
obvious  reaction  to  learning  that  an  unknown  is  a kind  of  something.  Consider  this: 

U6  IS  a BRICK. 

Without  further  fuss,  it  would  make  sense  for  the  student  to  assume  that  U6  has 
all  the  typical  slot-value  combinations  that  are  in  the  typical  instance  frame,  assuming 
that  U6  IS  of  medium  size  and  is  made  of  wood  as  a result.  But  the  student  should  also 
know  that  typical  instances  may  have  unfilled  slots  that  get  there  when  a slot  is  common 
but  does  not  appear  with  the  same  value  often  enough  for  a value  to  acommpany  it  to 
the  typical  member.  At  the  moment,  the  teacher  is  asked  to  supply  values  for  these  slots 
either  explicitly  or  by  reference  to  a source  with  the  proper  value.  For  the  example 
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given,  then,  the  following  is  printed: 

I assume  U6  has  the  slot-value  pairs 
SIZE-MEDIUM  and 
MATERIAL-WOOD. 

I am  curious  about  a value  for  COLOR. 

If  the  teacher  supplies  no  help  at  all,  it  is  conceivable  that  the  student  still  may 
successfully  fill  the  slot  by  guessing  a suitable  source.  Two  methods  come  to  mind  that 
parallel  the  methods  just  described  for  responding  to  a new  slot  instantiation.  Unlike 
the  other  methods,  neither  have  been  implemented. 

The  first  source  to  consider  is  found  by  again  appealing  to  justification-frame 
information.  The  justification-frame  construction  program  already  places 
JUSTIFIED-V ALUE  slot  information  just  as  it  now  places  TRIGGER-VALUE 
INFORMATION.  For  the  table  example,  we  have  this: 

PURPOSE  JUSTIFIED-VALUE  EAT  TRAMSFERRED-FROM  TABLE 

Wondering  about  the  value  for  the  PURPOSE  slot  of  something,  the  student  can 
use  TABLE  as  a possible  transfer  source  to  be  attempted.  If  there  are  many  justified 
values  in  the  slot's  frame,  then  the  student  might  well  want  to  screen  them  by  looking 
for  the  known  purposes  of  the  siblings  cf  the  destination  frame.  There  is  a greater 
chance  that  the  destination  will  have  a purpose  similar  to  one  of  its  siblings  than  to 
something  entirely  removed. 

The  second  source  to  consider  for  transfer  is  the  most  typical  sibling  of  the 
destination  frame  that  has  the  slot  in  question  filled.  This  would  not  be  done  if  the 
parent’s  admittance  with  respect  to  the  slot  is  low. 

Generalizing,  the  student  could  mo.'e  up  the  AKO  tree,  looking  for  a suitable 
sibling  of  the  more  remote  ancestors,  not  just  the  parent,  until  one  is  found  that  a lot  is 
known  about.  There  will  be  examples  of  this  when  we  discuss  animals.  There,  this  will 
amount  to  trying  a transfer  from  ROBOT,  the  thing  most  is  known  about,  through 
intervening  nodes  to  the  destination.  This  will  feel  good  only  if  the  similarity  of 
ROBOT  and  the  destination  is  high  with  respect  to  the  desired  property.  Similarity 
can  be  measured  by  the  existing  path-admittance  justification  method. 

The  Blocks  World  May  Be  Deceptively  Small 

The  small  number  of  properties  associated  with  each  object  may  be  a cause  for  some 
uneasiness.  Is  it  possible  that  the  examples  work  only  because  of  the  careful 
arrangement  of  the  slots  and  their  sma'I  number?  Maybe.  Indeed  one  important 
question  to  be  addressed  is  that  of  how  much  complexity  can  be  coped  with  before  the 
system  breaks  down.  Meanwhile,  two  points  are  probably  worth  observing: 

■ The  fact  that  things  are  immersed  n an  AKO  tree  will  tend  to  keep  the  clutter 
down.  Presumably  most  property  values  are  obtained  by  defaulting  to  higher  and 
higher  level  concepts. 
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■ Good  teaching  normally  requires  using  examples  with  relatively  few  prominent 
properties.  Good  examples  are  the  ones  for  which  the  computation  required  for 
deciphering  is  low. 

Indeed  the  reason  the  simple  physical  world  is  a good  source  of  general  metaphors, 
some  of  which  reach  the  social  world,  the  mental  world,  and  various  expert  problem 
solving  worlds,  may  be  because  its  simplicity  makes  the  metaphor  understanding 
problem  easier. 

EXAMPLES  FROM  THE  ANIMAL  WORLD 

Animal  world  is  shown  in  figure  6 and  figure  7.  We  will  use  it  first  to  review  some 
basic  hypothesis  and  justification  ideas,  then  we  will  turn  fieetingly  to  an  example 
involving  analogy,  and  finally,  we  will  look  at  the  use  of  standard  transfer  sources  such 
as  people  or  robots. 

Jack  And  Jill  Can  Be  Described  By  Aiiiinal  Metaphors 
Let’s  look  at  a sample  sequence; 

Jack  is  like  a fox. 

Since  fox  has  a very  high  value  for  cleverness,  it  is  concluded  that  Jack  does  too.  The 
context  becomes  intelligence  and  the  use  of  the  fox  as  a metaphor  for  cleverness  will  be 
noted. 


Jill  IS  like  an  elephant. 

Since  an  elephant  has  several  slots,  there  are  several  possibilities,  namely  memory, 
weight,  and  grace.  Good  memory  is  the  winner  though,  since  the  context  is  intelligence. 

Jill  IS  also  like  a cheetah. 

Evidently  Jill  is  fast.  The  context  now  has  to  do  with  motion  properties. 

Robbie  IS  like  an  elephant. 

The  context  now  singles  out  grace  and  transfers  a low  value  because  the  context  now 
has  to  do  with  motion,  not  weight  or  memory. 

Robbie  IS  a robot. 

Robbie  IS  like  an  elephant. 

Robbie  already  has  a grace  property.  The  transfer  must  have  something  to  do  with 
either  weight  or  memory.  Knowing  that  Robbie  is  a robot  helps  because  the  other 
robots  have  values  in  the  memory  slot  but  not  in  the  weight  slot.  Evidently  Robbie  has 
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Figure  6.  The  hierarchical  organization  of  a simple  animal  world. 
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(oeroAMe  anihal 

(AKO  (5VAL  (THINKING-THING))) 
(INSTANCE  (SVAL  (PERSON) 

(OOMESTIC-ANINAL) 

(WILD-ANINAL) 

(FURRY-ANIMAL) 

(EXTINCT-ANIMAL)))) 


(DEFRAME  BIRO 

(AKO  (SVAL  (WILD-ANIMAL))) 

(INSTANCE  (SVAL  (DODO)  (ROBIN)  (SPARROW))) 
(INTELLIGENCE  (SVAL  (VERY-LOW))) 

(SIZE  (SVAL  (VERY-LOW)))) 


(OEFRAME  CAT  (AKO  (SVAL  (DOMESTIC -ANIMAL)  (FURRY-ANIMAL)))  (SIZE  (SVAL  (LOW)))) 


(OEFRAME  CHEETAH  (AKO  (SVAL  (MAMMAL)))  (SPEED  (SVAL  (VERY-HIGH)))) 


(DEFRAME  DODO  (AKO  (SVAL  (BIRO)))  (EXTINCT  (SVAL  (YES)))  (INTELLIGENCE  (SVAL  (LOW)))) 


(OEFRAME  DOG  (AKO  (SVAL  (DOMESTIC-ANIMAL)  (FURRY-ANIMAL)))) 


(OEFRAME  DOMESTIC-ANIMAL 

(AKO  (SVAL  (ANIMAL))) 
(INSTANCE  (SVAL  (DOG)  (CAT))) 
(INTELLIGENCE  (SVAL  (HIGH))) 
(SIZE  (SVAL  (LOW)))) 


(OEFRAME  ELEPHANT 

(AKO  (SVAL  (MAMMAL))) 
(MEMORY  (SVAL  (VERY-HIGH))) 
(GRACE  (SVAL  (VERY-LOW))) 
(WEIGHT  (SVAL  (VERY-HIGH))) 
(SIZE  (SVAL  (VERY-LARGE)))) 


(OEFRAME  FOX 

(AKO  (SVAL  (MAMMAL)  ( FURRY-ANIMAL ) ) ) 
(CLEVERNESS  (SVAL  (VERY-HIGH))) 
(INTELLIGENCE  (SVAL  (HIGH))) 

(SIZE  (SVAL  (LOW)))) 


(OEFRAME  FURRY-ANIMAL 

(AKO  (SVAL  (ANIMAL))) 

(INSTANCE  (SVAL  (DOG)  (CAT)  (FOX)  (MASTAOON))) 
(INTELLIGENCE  (SVAL  (AVERAGE)))) 


(OEFRAME  GAZELLE  (AKO  (SVAL  (MAMMAL)))  (GRACEFULNESS  (SVAL  (HIGH)))  (SIZE  (SVAL  (AVERAGE)))) 


(OEFRAME  HIPPO  (AKO  (SVAL  (MAMMAL)))  (HEIGHT  (SVAL  (HIGH)))  (SIZE  (SVAL  (VERY-LARGE)))) 


Figure  7.  The  frames  that  define  animal  world. 
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(0CFRM1C  JACK  (AKO  (SVAL  (REASON)))) 


(DCFRAHI  JIU  (AKO  (IVAL  (PERSON)))) 


(OEFRAHE  MAMMAL 

(AKO  (SVAL  (WILO-ANIMAL))) 

(INSTANCE  (SVAL  (ELEPHANT)  (CHEETAH)  (FOX)  (GAZELLE)  (HIPPO))) 
(INTELLIGENCE  (SVAL  (VERY-HIGH))) 

(SIZE  (SVAL  (AVERAGE)))) 


(DEFRAME  MASTADON  (AKO  (SVAL  (FURRY-ANIMAL)))) 


(DEFRAME  PERSON 

(AKO  (SVAL  (ANIMAL))) 

(INSTANCE  (SVAL  (JACK)  (JILL))) 
(INTELLIGENCE  (SVAL  (VERY-HIGH))) 
(SIZE  (SVAL  (AVERAGE)))) 


(DEFRAME  R1  (AKO  (SVAL  (ROBOT)))  (MEMORY  (SVAL  (AVERAGE)})) 


(DEFRAHE  R2  (AKO  (SVAL  (ROBOT)))) 


(DEFRAME  R3  (AKO  (SVAL  (ROBOT)))) 


(DEFRAME  ROBBIE  (AKO  (SVAL  (ROBOT)))) 


(DEFRAME  ROBIN  (AKO  (SVAL  (BIRD)))  (SIZE  (SVAL  (VERY-LOH)))) 


(DEFRAME  ROBOT 

(AKO  (SVAL  (THINKING-THING))) 

(INSTANCE  (SVAL  (ROBBIE)  (SUZZIE)  (Rl)  (R2)  (R3))) 
(INTELLIGENCE  (SVAL  (AVERAGE))) 

(MEMORY  (SVAL  (HIGH))) 

(COMMON-SENSE  (SVAL  (AVERAGE))) 

(REASONING'POWER  (SVAL  (VERY-LOW))) 

(VERBOSITY  (SVAL  (LOW))) 

(SIZE  (SVAL  (AVERAGE)))) 


(DEFRAME  SHEEP  (INTELLIGENCE  (SVAL  (LOW)))  (SIZE  (SVAL  (AVERAGE)))) 


Figure  7 (continued).  The  framei  that  define  animal  world. 
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(OEFRAHE  SUZZIE  (AKO  (SVAL  (ROBOT)))) 


(OEFRAME  THING 

(AKO  ($IF-AD0E0  (ADD- INSTANCE))  (SIF-REHOVEO  (REMOVE-INSTANCE))) 
ONSTANCE  (STO-ADO  (( INSTANTIATE-A-FRANE))) 

(SIF-ADDEO  (ADD-AKO)) 

(SIF-REMOVEO  (REMOVE-AKO)) 

(SVAL  (THIHKING-THING)  (OBJECT)  (PROPERTY))) 

(SELF  (SOISCUSS  ((DESCRIBE-FRAME))) 

( BORDER  ( ( SLOTS -TO -BE • INSTANTIATED ) ) ) 

(STYPE) 

(SSLOTS  (t(SETMtNUSl  (HERITAGE -SLOTS  :FRAHE)  (FSLOTS  ’THING)))))) 


(DEFRAME  THINKING-THING  (AKO  (SVAL  (THING)))  (INSTANCE  (SVAL  (ROBOT)  (ANIMAL)))) 


(DEFRAME  WILD-ANIMAL 

(AKO  (SVAL  (ANIMAL))) 

(INSTANCE  (SVAL  (BIRD)  (MAMMAL))) 
(INTELLIGENCE  (SVAL  (AVERAGE))) 
(SIZE  (SVAL  (AVERAGE)))) 


Figure  7 (continued).  The  frames  that  define  animal  world. 
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a good  memory. 

Robbie  is  like  an  elephant. 

The  third  time  around,  only  weight  is  left.  The  context  becomes  size. 

Now  for  the  next  example,  suppose  the  frame  for  robot  has  the  following 
information: 


ROBOT  AKO  THING 

INTELLIGENCE  MEDIUM 
MEMORY  HIGH 

COMMON-SENSE  MEDIUM 
REASONING -POWER  LOW 
VERBOSITY  LOW 


These  properties  make  two  groups:  one  leals  with  intelligence,  memory,  common  sense, 
and  reasoning  power,  all  aspects  of  the  general  notion  of  intelligence,  and  the  other 
deals  with  verbosity,  a dimension  of  personality.  If  X is  unknown,  then  two  transfer 
frames  will  be  proposed  in  response  to  the  following  item: 

X is  like  a robot. 

Suppose  the  transfers  are  allowed  and  the  transfer  frames  are  recorded.  Then  consider 
the  following  sequence: 

Y has  medium  common  sense 

Y IS  like  a robot. 

What  properties  of  robots  are  preferred  for  the  next  transfer?  Intelligence,  common 
sense  and  reasoning  power  could  be  relevant  or  verbosity  might  be  right.  But  since  X's 
memory  is  already  known  to  be  good,  the  choice  is  to  pass  values  through  intelligence, 
common  sense,  and  reasoning  power  since  these  qualities  have  been  transferred  earlier 
as  a group  from  the  robot  concept  along  with  memory  which  already  has  a value  in  Y. 

Transferring  intelligence,  common  sense,  and  reasoning  power  properties  is  the 
preferred  action  because  having  one  fact  about  intelligence  makes  acquiring  more  a 
likely  possibility.  So  far  Y has  no  personality  properties  and  it  would  be  more  risky  to 
transfer  through  the  verbosity  slot. 

A Transformation  May  Be  Specified  Directly  Or  By  Analogy 

Of  course  a value  need  not  slither  through  a transfer  frame  unscathed.  Generally,  it 
may  be  subjected  to  some  sort  of  value  transformation.  VERY-HIGH  becomes 
VERY-LOW  if  T-OPPOSITE  is  the  transformation  in  effect.  MEDIUM  becomes 
HIGH  if  T-MORE  is  the  transformation.  An  APPLE  becomes  FRUIT  by  way  of 
T'CENERALIZE.  Other,  fancier  things  may  be  useful  in  making  metaphors  between 
worlds. 
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The  name  of  the  transformation  may  be  directly  specified,  of  course,  as  in  the 
following  fragment: 

John  IS  the  opposite  of  a fox. 

However,  the  transformation  may  be  given  by  an  analogy; 

Jane  resembles  a fox  in  the  same  way  John  does. 

After  CLEVERNESS  is  found  to  be  the  dimension  in  which  Jane  is  like  a FOX.  it  is  a 
simple  matter  to  use  the  corresponding  transfer  frame  to  test  John  against  FOX  to  find 
that  T-OPPOSITE  is  the  implied  transformation. 

Testing  the  transfer  frame  using  the  analogy  source  and  the  analogy  destination 
also  can  help  filter  out  wrongly  conjectured  transfer  frames  that  may  have  survived  all 
other  filtering  operations.  It  better  be  that  the  same  transformation  applies  to  all  of  the 
slots  in  the  transfer  frame  when  it  is  usee,  to  compare  the  analogy  source  and  analogy 
destination  frames.  Otherwise,  chuck  it  out. 

Notice,  incidentally,  that  the  source,  the  destination,  the  analogy  source,  and  the 
analogy  destination  may  all  be  different.  Notice  also  that  these  four  items,  together 
with  the  transfer  frame  and  the  transformation,  all  may  or  may  not  appear,  giving  a 
total  of  63  combinatorial  possibilities,  the  bulk  of  which  are  probably  absurd. 

Path  Admittance  Helps  Decide  If  .Slots  Can  Be  Filled  From  A Standard  Source 

Suppose  questions  are  asked  about  the  size  and  intelligence  of  a cheetah  in  the  context 
of  the  thinking-thing  information  shown  before  in  the  animal  AKO  tree  and  the  frame 
listing. 

As  mentioned  at  the  close  of  the  jus  ification  and  curiosity  section,  one  way  to  fill 
the  specified  slots  is  to  try  some  well-known  thing  as  a source,  trusting  to  the 
path-admittance  method  to  warn  against  b.id  transfers. 

Given  the  current  information  abcut  animals,  the  path  admisability  has  been 
calculated  for  certain  transfers  as  follows: 


ROBOT-TO-CHEETAH 

INTELLIGENCE-MEDIUM 

.07 

ROBOT-TO-DODO 

INTELLIGENCE-MEDIUM 

.06 

ROBOT-TO-CHEETAH 

SIZE-MEDIUM 

.28 

ROBOT-TO-DODO 

SIZE-MEDIUM 

.14 

It  IS  a bit  hard  to  follow  this  directly  because  the  program  that  calculates  the  numbers  is 
looking  at  a lot  of  nodes  and  it  is  looking  for  the  siblings  of  INTELLIGENCE  and 
SIZE  as  well  as  INTELLIGENCE  and  SIZE  themselves.  But  evidently  transfers  to 
cheetah  are  more  reliable  than  transfers  to  dodo  and  transfers  of  size  are  much  more 
reliable  than  transfers  of  intelligence.  This  makes  sense,  both  in  terms  of  the 
information  the  student  has  and  in  terms  of  what  we  would  expect  a priori. 

Now,  for  further  illumination,  we  can  follow  the  history  of  the  path  admittance 
of  the  robot-to-cheetah  transfers  as  other  information  is  added  to  the  students 
knowledge. 
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For  ROBOT-TO-CHEETAH.  INTELLIGENCE-MEDIUM 


A cheetah  is  a furry  animal. 

.08 

.40 

A sheep  is  a furry  animal. 

.40 

.30 

A dog  is  very  intelligent. 

.30 

.24 

For  ROBOT-TO-CHEETAH,  SIZE-MEDIUM 

A cheetah  is  a furry  animal. 

.28 

.25 

A sheep  is  a furry  animal. 

.25 

.38 

A dog  is  small. 

.38 

.30 

Keeping  the  known  facts  in  mind,  all  of  this  makes  some  sense.  Discovering  that  a 
cheetah  has  a new  connection  into  the  AKO  tree  means  the  shortest  path  to  robot  is 
different.  In  one  case,  this  drives  the  paih  admittance  down  slightly,  in  the  other,  up 
considerably.  The  large  increase  in  the  INTELLIGENCE-MEDIUM  number  reflects 
both  the  shortening  of  path  length  and  the  reduction  in  the  number  of  siblings  known 
to  be  of  the  wrong  intelligence. 

Learning  that  a sheep  is  also  a fur-y  animal  makes  the  size  transfer  look  better 
because  it  gives  the  cheetah  a new  sibling  with  the  same  size  as  the  robot,  but  it  makes 
the  intelligence  transfer  look  worse  because  a sheep  is  also  known  to  be  of  low 
intelligence. 

Finally,  getting  extra  information  about  a dog,  already  known  to  be  a furry 
animal,  makes  both  transfers  path  admittance  go  down  because  both  the  dog's  size  and 
intelligence  differ  from  the  robot’s. 

KEY  ISSUES 

There  has  been  fully  too  little  experiment  with  the  programs  and  the  ideas  in  them  to 
know  how  much  can  be  accomplished.  Iceas  have  been  illustrated,  but  certainly  none 
have  been  solidly  demonstrated.  Many  more  experiments  and  much  larger,  more 
completely  specified  domains  are  necessary  to  do  that.  Still,  there  is  some  preliminary 
hope  that  the  following  principles  may  hold: 

The  principle  of  representational  parsimony.  If  all  sorts  of  knowledge  is  represented 
uniformly,  then  it  is  all  subject  to  the  same  learning  processes.  Since  objects,  properties, 
and  even  Justifications  have  the  same  representation,  all  can  be  learned  about  through 
transfer  frames.  With  respect  to  domain,  any  in  which  the  objects  and  properties  can 
be  described  in  terms  of  frames  is  potentially  a domain  that  learning  using  transfer 
frames  can  address. 

The  principle  of  expanding  competence.  The  more  that  is  known,  the  better  learning 
should  be,  both  in  terms  of  speed  and  accuracy.  Certainly  speed  and  accuracy  should 
increase  with  increasing  knowledge  when  learning  is  by  transfer  frames  since  the  more 
the  student  knows,  the  easier  it  is  for  the  teacher  to  find  lucid  examples  less  subject  to 
misinterpretation. 
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SPECULATIONS 

Metaphor  Traces  Could  Be  Used  To  Find  Substitutes,  Notice  Attributes,  And  Pass 
The  Time 

Look  again  at  the  example  of  the  table  transfer  to  the  cube.  Having  noticed  that  the 
transfer  took  place  while  both  frames  were  observed  to  be  of  medium  sue  and  to  have 
flat  lops,  a lot  of  information  was  recorded  that  might  be  used  as  follows; 

■ Having  made  the  transfer  to  Cl,  idle  time  could  be  spent  seeing  if  relatives  of  Cl 
are  also  like  TABLE  when  compared  through  the  JUSTIFICATION-FRAME, 
JF-l.  If  so,  proceed  to  learn  more  by  making  the  transfer  through  the  transfer 
f rame,  TF-1. 

■ To  find  something  which  would  serve  the  same  purpose  as  a table,  note  that  the 
table’s  purpose  was  transferred  earlier  by  using  the  table’s  recorded  transfer 
frame  and  Justification  frame.  See  if  anything  in  the  physical  vicinity  satisfies 
the  justification  frame.  Index  into  the  frames  in  the  vicinity,  perhaps  using  the 
Justification  frame's  slots. 

■ To  find  something  whose  purpose  is  to  serve  in  writing,  look  into  the  frame  for 
PURPOSE  and  note  that  T.^BLE  has  been  a source  of  metaphors  for  writing. 
Get  the  TRANSFER-FRAME  and  JUSTIFICATION-FRAME  information  from 
TABLE. 

Past  Transfers  Could  Be  Used  In  Generating  Descriptions 

The  simplest  ways  to  generate  transfers  is  to  bounce  back  information  previously 
digested.  The  system  already  leaves  certain  information  behind  to  enable  this. 

First,  when  transfers  are  used  to  transfer  information  into  a concept  later  to  be 
described,  the  source  is  recorded.  If  Sam  was  said  to  be  like  a fox,  it  would  be  easy  to 
say  this; 

Sam  IS  very  intelligent,  like  a fox. 

Just  having  this  would  make  conversation  dull,  full  of  triteness,  but  other  devices  could 
be  used; 

■ When  a frame  has  a slot  filled  with  a VERY-HIGH  or  VERY-LOW  value,  the 
fact  could  be  recorded  in  the  slots  frame  by  instantiation  of  the 
VERY'HIGH-VALUES  or  VERY-LOW-VALUES  slots  using  the  name  of  the 
frame  where  the  exceptional  value  was  recorded.  This  frame  is  then  a possible 
transfer  source. 


Winston 


22 


May  3.  1977 


■ If  a frame  with  an  extreme  value  was  used  as  a source  before,  it  should  be 
particularly  good.  It  is  better  if  it  has  not  served  as  a transfer  source  contributing 
to  the  description  of  the  thin‘<'  to  be  described. 

B If,  in  looking  over  what  is  to  be  said,  there  are  many  possible  sources,  the  transfer 
generator  can  run  its  various  source  possibilities  through  the  filters  using  its  best 
guess  about  what  the  listener  already  knows  about  the  concept  being  described. 
Clearly  the  best  descriptions  are  the  ones  that  allow  rapid  filtering  down  to  the 
correct  transfer  frames.  This  mean*,  the  sources  specified  will  automatically  tend 
to  tell  the  listener  facts  it  is  interested  in  knowing  about  and  stick  to  a context, 
among  other  things. 

■ The  transfer  generator  may  decide  it  is  folly  to  do  the  whole  description  as  a 
chain  of  metaphors.  Instead  it  may  be  better  to  explicitly  specify  a slot  or  a 
context  from  time  to  time. 

■ The  transfer  generator  can  bias  itsjlf  by  choosing  sources  from  either  pleasant 
categories  (fields  and  flowers)  or  unpleasant  ones  (fire  and  brimstone). 

■ As  an  additional  literary  device,  a pointer  into  the  AKO  tree  should  be 
maintained  and  transfer  sources  should  be  selected  from  the  descendants  of  it. 
This  would  tend  to  help  avoid  inelegant  mixing  of  metaphors. 

SUMMARY 

The  path  has  been  involved.  Therefore  it  makes  sense  to  put  the  key  ingredients  on 
display  now,  by  way  of  summary: 

Fraine-like  representation.  A representation  is  a vocabulary  of  symbols  and  a set  of 
conventions  for  arranging  them  to  describe  things.  Obviously  representation  is  a 
central  issue  in  attempts  to  understand  learning,  for  nothing  can  be  learned  unless  there 
is  a representation  that  can  capture  the  new  knowledge  to  be  learned.  Consequently, 
when  a new  and  powerful  representation  n found,  it  is  useful  to  examine  it  with  a view 
toward  addressing  the  competences  exhibited  by  learners. 

Frame  terminology  is  used  here  beciuse  it  is  rich  and  because  the  program  that 
has  been  described  makes  use  of  a sm;.ll  number  of  features  of  FRL,  the  frame 
representation  language  created  by  Goldstein  and  Roberts  [Roberts  and  Goldstein]. 

A simple  property  list  is  a repres'intation  in  which  things  are  described  by 
properties  that  can  assume  values.  The  frame  representation,  invented  by  Minsky  and 
developed  by  Goldstein  and  Roberts,  is  a newer  representation  in  which  the  notion  of 
property  list  is  generalized  [Minsky].  Insteid  of  properties,  there  are  slots.  One  facet  of 
a slot  is  Its  value,  but  unlike  properties,  slots  can  have  many  facets,  not  just  a value 
facet.  Among  these  facets  are  places  where  demon-like  procedures  reside,  waiting  for 
insertions,  deletions,  or  accesses  that  triggei  them  into  taking  a piece  of  the  action. 


For  our  purpose,  the  value  facet  of  a slot  was  the  most  important  one,  and  we 
refered  to  the  value  without  being  more  precise.  Occasionally  it  is  useful  to  know  that 
other  facets  exist,  the  restriction  facet  in  particular.  The  restriction  facet  dictates 
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constraints  on  what  is  allowed  in  the  value  facet.  Finally,  the  values  of  some  slots  have 
comments  associated  with  them  that,  for  e>ample,  can  give  information  about  where  the 
value  came  f rom. 

It  is  also  useful  to  know  chat  frames  are  inherently  arranged  in  hierarchies  so  that 
access  to  one  frame  can  cause  access  to  a second  frame  from  which  the  first  inherits 
information. 

The  destination  frame  is  the  thing  to  be  learned  about.  It  has  slots  that  may 
assume  values.  Often  values  are  supplied  by  a source  frame  that  happens  to  have  a slot 
value  suited  to  the  destination.  Thus  if  it  is  said  that  Robbie  is  sly  as  a fox,  then  fox  is 
the  source,  Robbie  is  the  destination,  cleverness  is  the  slot,  and  very-high  is  the  value. 

Hypothesizing  and  evaluating  transfer  fiaines.  Typically  there  are  many  possible  ways 
the  source  may  be  like  the  destination.  A combination  of  the  known  properties  of  the 
source  and  destination  must  be  used,  perhaps  together  with  context,  to  make  the  correct 
judgement  about  what  is  to  be  learned.  This  is  done  using  a transfer  frame,  a frame 
that  stands  between  the  source  and  the  destination  like  a template  and  determines  the 
information  that  is  transferred.  A key  idea  is  that  these  transfer  frames  can  be 
generated  dynamically  by  the  student  using  a variety  of  common-sense  methods  that 
access  what  is  already  known.  The  good  teacher,  knowing  how  these  methods  work  and 
having  a rough  model  of  what  is  already  known  by  the  student,  can  teach  in  a way  that 
improves  both  the  transfer  rate  and  accuracy. 

Grouping  and  the  typical  instance.  Gioups  are  important.  Croups  of  things  tend  to 
have  the  same  properties,  not  just  a single,  group-defining  property.  Consequently,  an 
abstract  typical  instance  can  be  constructed  for  a group.  The  typical  instance  consists  of 
a number  of  slots  and  slot  values  that  capture  the  essence  of  the  group  it  describes. 
The  notion  of  typical  instance  derives  from  earlier  work  [Winston]  [Davis].  The 
importance  of  typical  instances  in  learning  seems  clearer  now  because  they  are  the  key  to 
several  ideas  for  hypothesizing  and  evaluating  transfer  frames.  The  typical  instance 
descriptions  of  groups  that  the  source  belong  to  help  hypothesize  transfer  frames  and 
those  for  the  destination  help  evaluate  those  transfer  frames. 

Similarly,  groups  of  properties  are  mportant.  The  group  dealing  with  size,  for 
example,  encompasses  weight,  height,  width,  depth,  and  general  size.  If  one  is 
mentioned,  it  helps  to  establish  a context  in  which  the  others  are  expected. 

Dissimilarity  measurement  using  network  path  adniittance.  All  things  reside  in  a 
network  of  a-kind-of,  or  AKO  relations.  For  every  AKO  there  is  a complementary 
IN  ST  ANCE.  Paths  between  the  source  and  destination  in  the  AKO-INSTANCE 
network  offer  some  help  in  deciding  if  a given  transfer  makes  sense:  if  nodes  along  a 
path  show  a strong  tendency  for  slots  like  those  in  a proposed  transfer  to  be  filled  with 
the  the  right  values,  then  the  network  has  high  path  admittance  and  there  is  some 
support  for  the  transfer. 

Interesting  experiments  with  children  suggested  this  idea  [Carey- Block].  If  a 
small  child  is  asked  if  an  animal  has  a ceitain  organ,  it  often  responds  with  a certainty 
proportional  to  the  apparent  overall  similarity  between  the  given  animal  and  the  most 
common  thing  that  the  child  knows  has  the  organ,  a human  typically.  A child  will  be 
less  sure  that  a bird  has  a liver  than  that  a monkey  has  one. 
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Creating  and  using  justification  frames.  Keeping  track  of  specific  properties  that 

legitimize  filling  slots  in  various  ways  is  another  way  to  judge  a proposed  transfer. 

These  specific  slot-value  combinations  are  stored  in  justification  frames.  These 

justification  frames  can  be  accumulated  by  experience.  They  can  also  be  acquired  and 

honed  by  dialogue  with  the  teacher,  just  as  other  frames  can  be. 
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